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A regular realizability (RR) problem is testing nonemptiness of intersection of 
some fixed language (filter) with a regular language. We show that RR problems 
are universal in the following sense. For any language L there exists RR problem 
equivalent to L under disjunctive reductions on nondeterministic log space. 

We conclude from this result an existence of complete problems under polyno- 
mial reductions for many complexity classes including all classes of the polyno- 
mial hierarchy. 

Motivation of this work is to find out a specific class of algorithmic problems that 
represents in a unified way complexities of all known complexity classes (there are 
hundreds of them now). 

A typical algorithmic problem is recognition problem for a language. But in the 
most interesting cases an input is structured: it is a graph, function description etc. Our 
main goal is to choose a structure of an input to satisfy two (somewhat contradictory) 
requirements: a specific class of problems should be wide enough and it should be 
useful. The latter requirement reflects a hope that analysis of a specific problem might 
be easier than a general case. 

Here we consider regular realizability problems in this context. Let L be a language 
(we refer it further as a filter). The regular realizability problem with this filter is a 
question about realizability of regular properties on words in L. More exactly, the 
language RR(i) consists of descriptions of regular languages R such that Rn L 7^ 0. 

What are possible complexities of RR problems? In this paper we obtain a partial 
answer on this question. It appears that RR problems are universal: for any other 
problem there exists an equivalent RR problem. 

To make exact statements we need to fix a format for descriptions of regular lan- 
guages and an equivalence relation. 

We represent a regular language R by a deterministic finite automaton (DFA) A 
recognizing the language R (and denote this fact as R = L(A)). DFAs are described 
in natural way by transition tables. Details of the format used can be found in (TJ. 
Important features for this work are: (i) each binary word w is a description of some 
DFA A(w) and (ii) testing membership for a regular language L(w) = L(A(w)) can 
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be done using deterministic log space. So formal definition of the language RR(L) 
corresponding to RR problem with a filter L is 

to G RR(L) » in L(A(w)) ^ 0. 

Equivalence relations considered here are induced by algorithmic reductions. For 
any language X an RR-representative of X (under reductions of some type) is a lan- 
guage L such that RR(X) is equivalent to X under reductions of this type. 

The most natural reductions in the context of regular realizability are m-reductions 
using log space (^^-reductions). There exist RR problems complete under log space 
reductions for complexity classes such as LOG, NLOG, P, NP, PSPACE, EXP, £ 1; 
see EED. 

We can prove universality result mentioned above for stronger reductions: disjunc- 
tive reductions using nondeterministic log space (definition see below in Section [TJ. 
In the proof we reduce an arbitrary language X to RR language by a mono reduction: 
a reduction of special kind that sends a word to to a description of DFA accepting 1- 
element set {f(w)}, where / is an injective map. Disjunctive nlog space reductions 
appear in an attempt to invert a monoreduction (see Section 0. 

For many cases disjunctive nlog space reductions are weaker than polynomial re- 
ductions. In this way we extend a list of classes having complete RR problems under 
polynomial reductions. It will be shown in Section [2] that there exist RR complete 
problems for each class of the polynomial hierarchy. Note that it is a rather surprising 
even for the class co-NP: RR problem is formulated by existential quantifier (an exis- 
tence of an accepting path possessing specific properties) and there is no direct way to 
convert it into universal quantifier. 

We also present two other universality results. 

In Section[3]we show that RR problems are universal for promise problems. In this 
case inversion of a monoreduction is much simpler. But it requires FNL reductions 
too. It seems that using nondeterministic log space is unavoidable (see Remark [3] in 
Section©. 

In Section [5] we extend universality to the classes of generalized nondeterminism 
introduced in Technically it suffices to prove universality of RR problems with 
prefix closed filters. The proofs in this section are similar to the proofs in Section|4]but 
they involve more technical details. 

Thus RR problems are universal. They represent a huge variety of complexities. 
Are they useful? The proof suggests a negative answer. Reductions used in the proof 
cut off almost all properties of regular languages and put 'the hard part' of a problem 
into instances corresponding to finite languages. Of course, nlog reductions say nothing 
about languages inside NLOG. 

1 What reductions are used in universality results 

Equivalence relations used here are defined using reductions. Languages are equivalent 
(notation A ~ E) if they are reduced to each other under reductions of some type. 

We recall some basic definitions concerning algorithmic reductions. Let C be a 
function class. A language A is reduced to a language B under m-reductions by func- 
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tions from the class (notation if there exists a function / G C such that x € A iff 
f(x) € -B. If the class C contains the identity map and is closed under compositions 
then m-reduction relation is a preorder, i.e. transitive and reflexive relation. 

The most known reductions of this type are polynomial reductions (see, e.g. (4)). 

In this paper we need weaker m-reductions. The corresponding functions are com- 
puted by deterministic Turing machines using space logarithmically bounded w.r.t. the 
input length. We denote these reductions by ^|° s . More exactly, the machine has the 
read only input tape, the work tape of size 0(log n), where n is an input size and write 
only output tape. It easy to check that the relation ^J^ g is transitive. Note that the 
size of f(x) is polynomially bounded by the size of x (the number of configurations 
is polynomially bounded). So, the log space algorithm for a composition f(g(x)) uses 
a subroutine to compute zth bit of g (x) to simulate the input g(x) for the algorithm 
computing /. More details can be found in textbooks on complexity theory, say, |H][5). 

The second important type of reductions is Turing reductions. A language A is Tur- 
ing reducible to a language B if there exists an algorithm recognizing A that uses an 
oracle B. There are several restricted forms of Turing reductions. In truth-table reduc- 
tions a reducing algorithm generates a list of oracle queries qi , . . . , q s and a Boolean 
function a depending on s arguments. Then algorithm asks all queries and outputs 
a(Xs(<?i), • • . ,Xs(<? s )). 

Note that for log space Turing reductions and truth-table reductions are equiva- 
lent 0. 

We use in the main result a weaker form of truth-table reductions — disjunctive 
reductions (notation ^ dtt ). In this case the function a is disjunction. 

Disjunctive reductions can be expressed via 771-reductions in the following way. 
Define a language Seq(AT) as a collection of words in the form 

#£l#£2#---£n#, 

where Xi £ X for some i, # is a delimiter (an additional symbol that do not belong to 
the alphabet of the language X). The following statement is clear from definition. 

Proposition 1. A ^ dtt B iff A < m Seq(S). 

Words from Seq(AT) can be identified with finite sequences of words from X in 
natural way. Hereinafter we assume this correspondence. 

Definition 1. A class of languages is normal if it is closed under the map X i-» Seq(X) 
and s^f-reductions: if X e C and Y s^° s X then Seq(X) e C and Y G C. 

Definition 2. A language X is ^ m -normal if X ~ Seq(X) under ^ m -reductions. 
If a function class is not indicated we assume log space reductions. 

Proposition 2. Seq(X) is normal for any X. 

Proof. It is obvious that Y ^J° s Seq(F) for any Y. Thus we need to prove that 
Seq(Seq(X)) <£s Seq(X). Note that 

Seq(Seq(X)) <° g t X. 



3 



Indeed, a disjunction of disjunctions is a disjunction. So the reducing algorithm form 
a query list consisting of all elements of all elements of an input sequence. 

To complete a proof apply Proposition [I] □ 

Lemma 1. Let Seq(X) <£ s Y, Y <J° g Seq(X). Then Y is normal. 

Proof. We denote reducing maps by / : (x\, . . . , Xt) i-> y (for the first reduction) and 
q '. y i — y (x% , . . . , Xt) (for the second one). 

Now we reduce Seq(F) to Y. Compute sequences g(yi) for all elements of 
the sequence (yi, . . . ,y s ) and merge all of them into a sequence h(y) consisting of 
words from the language X. Apply to this sequence the map /. It is a reduction in 
question. Correctness stems from the fact that y 6 Scq(F) iff h(y) € Seq(X) iff 

/(%)) e Y. □ 

Corollary 1. If a normal class contains m-complete languages then all complete lan- 
guages in this class are normal. 

Now we discuss a choice of a function class for disjunctive reductions. 

It is natural that we prefer the weakest possible class. Log space seems to be un- 
sufficient (see Remark[3]below). 

Next step is to use nondeterministic space. The corresponding class is denoted by 
FNL. Machines computing FNL functions use a logarithmically bounded work tape 
and an unbounded oracle tape which is one way and write only. An oracle puts its 
answer on the oracle tape and overwrites a query. More details on the class FNL and 
its companions can be found in |7|. In particular, the composition lemma holds for the 
class FNL and the size of output is polynomially bounded by the input size. 

We denote FNL disjunctive reductions by ^™ L - The FNL m-reduction is denoted 

by^ NL . 

It is clear from definition that -reductions are stronger then log space reduc- 
tions and are weaker then disjunctive reductions in polynomial time. So, we have the 
following facts. 

Proposition 3. If A B then A B. 

Proposition 4. If A s$™ L B then A <^ tt B. Moreover, if B is normal then A <™ L B. 

Remark 1. The universality is much easier to prove for exponential time reductions. 
But these reductions do not say anything about the most interesting realm of comple- 
xities— PSPACE and below. 

For polynomial time reductions we need the same constructions that are used in our 
proof below. The algorithmic facts become easier. 

We use FNL reductions to cover P and below. Note that as shown in 1 8 1 basic 
counting log space classes are closed under <™ L -reductions. 



2 Examples of normal classes 

We will apply Corollary Q] and ^™ L -universality results to prove that a complexity 
class contains complete RR problems. Corollary Q] requires a class into consideration 
to be normal. 
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The most of known complexity classes are normal. We give several examples. In 
definitions of complexity classes and computational models we follow Arora and Barak 
book El. 

A straightforward algorithm for recognizing language Seq(X) is to check xi € X 
for all Xi from the input 

and take disjunction of the results. 

Let X e DSPACE(/(n)), where f(n) = O(logn). Then the above algorithm 
uses space 0(f(n)). Thus the class DSPACE(/(n)) is normal (it closed under ^j° s - 
reductions by obvious reasons). 

With small modifications the same argument is applied to nondeterministic space 
classes. Nondeterministic algorithm guesse i such that Xi g X and runs an algorithm 
recognizing X on an instance xi. So the classes NSPACE(/(n)), f(n) — f2(logn), 
are also normal. 

For time complexity classes closeness under ^J° g -reductions holds for the class P 
of polynomial time and more powerful classes. Running time of the above algorithm 
recognizing Seq(X) is upperbounded by 

f(n) = n+ max (T(m) + • • • + T(n m )), (1) 

n— niH \-n m 

where T(n) is a running time of an algorithm recognizing X. It is clear that T(n) — 
poly(n) implies T(n) = poly(n). So P is normal. For more powerful classes normal- 
ity of a time complexity class follows from closeness of time limitations under the map 
T(n) i — ^ T(n). There is a simple sufficient condition for closeness: if a function T(n) 
satisfies time limitations then nT(n) is also satisfies time limitations. Applying this 
observation we get normality for classes of quasipolynomial time, exponential time, 
simple exponential time and similar limitations. 

It is easy to see that under the same conditions nondeterministic time classes are 
also normal. 

The last example are classes of the polynomial hierarchy. 

Proposition 5. Each class of the polynomial hierarchy is normal. 

Proof. Closeness under ^|^ s -reductions is clear for each class of polynomial hierarchy. 
So it remains to show closeness under the map X M- Seq(X). 

Let ISeq(X) be a language consisting of #(i)#a;i . . . #x m # such that xi G X. 
Here the delimiter symbol # does not belong to the alphabet of the language X and 
(i) denotes binary representation of integer i. It is clear that X ISeq(X). Indeed, 
using log space one can extract ith list element. 

By definition of IScq(X) we have 

(w G Seq(X)) «*■ 3i(#iw E ISeq(X)). (2) 

Note that if X e T, p k then {#iw : #iw e ISeq(X)} is in E p k . Thus Seq(X) is in 
due to @. 

''Hereinafter a guess is a nondeterministic choice. 
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Suppose now that X 6 II?. In this case {#iw : #iu> £ ISeq(X)} is in II| and we 
have for some V € E? 1 and polynomial p(-) 

(#iw e ISeq(X)) & Vy(\y\ < A S V). 

So we need to interchange quantifiers in (fj). Due to this fact one can apply an equality 

3i € SVyW(x,i,y) = V f V W(x, i, y l ), 
ies ies 

where y l are new variables indexed by i e S. Thus Scq(X) 6 IT^. □ 

3 Monoreductions 

Without loss of generality we consider languages in binary alphabet. 
Let / be an injective map {0, 1}* -> {0, 1}*. 
A monoreduction is a map 

z'-^/Cx), (3) 

where A w is a description of minimal DFaE3 recognizing the 1-element language {w}. 
Note that the number of states in minimal DFA coincides with the number of Myhill - 
Nerode classes ifTTl . It is easy to verify that the number of Myhill - Nerode classes for 
the language {w} is just \w\ + 2 (all prefixes of the word w plus one) and the function 
w i y A w is computed on deterministic log space. 

Informally speaking, the map (01 assigns 'names' for all binary words in the form 
of automaton description. Injectivity condition implies that names of different words 
are different. 

If a map f(x) is log space computed then the corresponding monoreduction is also 
log space computed. It reduces a nonempty language C X C {0, 1}* to some RR 
problem 

X^RR(Y). (4) 
It is easy to see from definitions that © holds iff 

Ynf(X) = 0, YDf(X). (5) 

In other words, Y separates images of X and X and Y contains the image of X. 

Complexity of a reduction in the opposite direction depends heavily on Y . 

Take for example Y — f(X). It is not a good choice because complexity of the 
language RR(/(X)) varies in wide range w.r.t. complexity of the language X. It can 
be illustrated in the simplest case / = id. 

There exists a filter L such that (a) the membership problem for L is in the class of 
languages recognized by RAM in linear time; (b) RR(L) is complete for the class Si 
of recursively enumerable languages under m-reductions f3l . 

2 'We assume that construction of minimal automaton is fixed. For minimization algorithms see textbooks 
on formal languages, e.g. 1111 . 
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On the other hand for a X = {0"} a language RR(X) is in LOG. It was shown 
in [T) that LOG is ^|° s -reducible to any RR problem with infinite filter. 

Nevertheless, the reduction X ^J° s RR(X) can be inverted if we consider reduc- 
tions among promise problems. A promise problem is a problem of computing a par- 
tially defined predicate. In other words there are two languages L% and Lq such that 
L\ n Lq = 0. The question is to test membership w G L\ provided either w £ L\ or 

W E Lq. 

Promise problems have more expressive power than languages (which correspond 
to total predicates). For many complexity classes, say NP n co-NP or BPP, an exis- 
tence of complete languages in a class is an open problem. But there are simple and 
natural examples of complete promise problems for these classes. 

The question about complete RR promise problems is also much easier than the 
question about complete RR languages. 

Theorem 1. Any promise problem {L\, Lq) with L\ =^ is equivalent to RR promise 
problem under nlog space reductions. 

Proof. Define RR(ii : \R\ — 1) as RR problem with a promise = 1, i.e. a 

pair of languages 

{{A : L(A) n Li ^ A \L(A)\ = 1}, {A : L(A) (1L 1= 0A \L(A)\ = 1}). 

Then a promise problem (L\, Lq) is ^J^ s -reduced to the problem RR(Xi : \R\ = 1) 
by the map w H> A w . 

In the other direction we can prove a weaker reduction 

RR(L 1 :\R\ = 1)^ L (L 1 ,L Q ). (6) 

To construct a reduction © we need a procedure that find a word accepted by DFA A 
provided A recognizes a 1 -element language. This procedure is easily implemented in 
the class FNL: it nondeterministically guesses the word symbol by symbol maintaining 
the current state of the automaton reading the word. □ 

Remark 2. We call an automaton simple if it recognizes a 1 -element set. Testing 
simplicity can be done using nlog space. 

Testing algorithm again guesses a word accepted by the automaton symbol by sym- 
bol. But now it also checks a possibility to choose more than one symbol in such a way 
that after each choice the automaton can be moved to an accepting state by reading 
some sequence of symbols. Here we use a well-known equality co-NLOG = NLOG 

EEL 

Remark 3. Is it necessary to use NLOG-oracle in the reduction ((6)? The question is 
open but the negative answer is more plausible. In the non-uniform settings the class of 
unambiguous nondeterministic log space coincides with the class of nondeterministic 
log space ifTOl . It is quite natural to suggest that simplicity test do not belong to LOG 
if LOG 7^ NLOG. 
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Remark 4. The unique word accepted by DFA can be easily recovered from special 
forms of DFA description (say, description of minimal DFA accepting 1-element lan- 
guage). But it does not help to improve the reduction (|6) because we are interested 
in regular realizability problems (the answer depends on a language and should be the 
same for all automata recognizing the language). 

4 Universality of RR problems 

In the case of language reductions we are able to prove a weaker version of TheoremQ] 
using disjunctive reductions. 

Theorem 2. For any non-empty language X there exists a filter L such that 

X4°S RR(iK FNL x (?) 

In the proof of Theorem|2]we use the other extreme case for conditions ©. Namely, 
we choose Y = Xf = f{X). 

The idea behind the proof is to approximate inversion of a monoreduction as close 
as possible. The difficulty of inversion stems from the fact mentioned in Remark |4] 
instances of RR problem are all regular languages and RR problem might be hard for 
languages that are not 1-element languages constituting the image of a monoreduction. 
To overcome this difficulty we choose a filter satisfying conditions (0 in such a way 
that for most regular languages RR problem with the chosen filter is trivial. 

The first step toward implementation of this idea is to make RR problem trivial for 
all infinite languages. 

Definition 3. An infinite language is regularly immune if it does not contain any infinite 
regular language. 

Suppose that /({0, 1}*) is contained in some regularly immune language. Then 
any infinite regular language has a non-empty intersection with /({0, 1}*)- So for any 
infinite instance of RR(X/) the answer is positive. 

For finite instance of RR(X/) m-reducing algorithm should indicate a word that 
possibly belongs to X. It seems very hard for arbitrary X. 

Therefore the second step is to use disjunctive reductions. In this case reducing 
algorithm should just produce list of all words accepted by a specific automaton and 
this task is much easier in general case. 

Note that examples of regularly immune languages are numerous and easy to con- 
struct. Any such language can be used in reduction outlined above. But the cardinality 
of a finite language can be exponentially larger than the number of states in DFA rec- 
ognizing the language. So such a reduction requires an exponential time. 

Thus the next step is to choose a regularly immune set D possessing a special 
property: the cardinality of any regular language in D is polynomially upperbounded 
by the number of states in DFA recognizing the language. To guarantee a polynomial 
upper bound it is sufficient to require that D consists of squares only (see Remark|6). 
Below in Lemma[3]we give a linear bound. 
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Last but not least, all actions mentioned above should be efficiently implemented. 
We are going to construct a ^™ L -reduction. So we need FNL implementations. 

To realize the plan outlined above we start from specification of D. Let j3 be a 
morphism /3(0) = 01, ,3(1) = 10 and sq(-) be a map 

sq: x i y /3(x)l 2 |a;|2+3 /3(a;)l 2 |:l;|2+3 . (8) 

We choose D — Im(sq), i.e. an image of all binary words under the map sq. 

To prove that D is regularly immune one can use Parikh theorem iTTTI [LP . It says 
that the lengths of words from a regular (or even CFL) language form a semilinear set, 
i.e. a finite union of arithmetic progressions. 

Lemma 2. D is regularly immune. 

Proof. Lengths of words from D form the set {2n 2 + An + 10 : n £ N}. It is clear that 
intersection of this set with any arithmetic progression is finite. □ 

To give an upper bound on the cardinality of a regular language contained in D we 
make a couple of observations. 

By definition, D words are squares. Moreover, they are incomparable in the fol- 
lowing sense. 

Proposition 6. Letpq!,pq 2 G D. Thenpe /3({0, l}*)(e U {0, 1}). 

Proof. Note that w = sq[x) can be recovered from the prefix f3(x)ll: the first occur- 
rence of 11 starting at even position^] signals that the prefix f}(x) is completed and x is 
uniquely determined by this prefix. □ 

Proposition [6] will play an important role below. In particular, we will use the 
fact that common prefix of words from D does not contain 3 (easily follows from 
Proposition |6j. Just now we indicate an another simple corollary. 

Corollary 2. No word from D is a prefix of an another word from D. 

Remark 5. For an arbitrary language of squares Corollary 0does not hold: 0101 is a 
prefix of 010111010111. 

Now we are ready to prove a linear bound on the cardinality of a regular language 
L(A) that sits in D. 

Lemma 3. Let A be DFA such that L(A) c D. Then \L(A)\ < \Q\, where Q is the 
state set of A. 

Proof. It is sufficient to consider a minimal DFA B recognizing L(A). The states of B 
are in one-to-one correspondence with Myhill - Nerode classes. Consider two words 
ititti 7^ U2U2 in L(A). We prove that U1U2 $ L(A). It implies that U\ and are not 
equivalent and the number of Myhill - Nerode classes is not less than 

Suppose that u\U2 — vv £ L(A). Either ui is a prefix of v or v is a prefix of u\. 
In both cases we come to a contradiction with Proposition [6] both u\ and v contain 3 
as a sub word. □ 

3 'We enumerate positions in a word starting with 0. 
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Note also that lengths of words of a finite regular language L(A) are also linearly 
upperbounded by the number of states of DFA A. 

Proposition 7. Let A be DFA with the state set Q. If \L(A)\ < oo then for any 
w G L(A) we have \w\ < \Q\. 

This fact can be easily extracted from proofs of pumping lemma for regular lan- 
guages. An equivalent statement: L(A) is infinite iff there exists an accepting walk on 
the graph of the automaton A containing a cycle. 

Now we construct algorithms used in the proof of Theorem |2] 
Note that the binary representation of the length of word x has a size 0(log \x\). 
So arithmetic operations with numbers of this magnitude can be performed using log 
space. This fact is widely used below. 

Proposition 8. The membership problem for the language D is in LOG. The map sq 

and the inverse map sq -1 are log space computed. 

Proof. The algorithm that computes sq repeats twice the following procedure: compute 
f3(w), where w is the input word, add to the result l 2 , then add Os (the required number 
of zeroes is computed using log space). 

The membership test for D can be done in two stages. At first the algorithm checks 
that an input word is a square uu (easily performed using log space). If so, on the 
second stage the algorithm finds an occurrence of 11 in the word u starting at even 
position and splits u in a prefix and suffix removing this sub word 11. After that it 
verifies that the prefix is a code j3{x) of some x and the suffix is O' 1 +3 . 

Computing of the inverse map is performed in similar way. □ 

In nondeterministic algorithms working with inputs containing a description of 
DFA A we use procedures 'guessing a word' and 'guessing a square'. 

Guessing a word is a repeating guessing of symbols from the alphabet of A. These 
symbols form a word and in parallel algorithms will test some properties of this word. 
Tests should use log space and read a word in one-way. Note that DFA description 
is not shorter than the alphabet size as well as the cardinality of the state set. So, log 
space is sufficient to store a constant number of symbols and states. 

A simple example of a test is a check that a guessing word is accepted by A. This 
test stores a current state of the automaton and applies the transition function. It has 
been used above in the proof of Theorem [TJ 

Guessing a word is also used in algorithmic version of Proposition|7] 

Proposition 9. Infiniteness of a regular language is in NLOG provided a language is 
represented by DFA recognizing it. 

Proof. The algorithm guesses a state q of an input DFA A. After that it guesses a word 
w e L(A) such that the automaton visits q at least twice while reading w. The latter 
condition is easily verified on log space even if an input should be read in one-way. □ 

The next example will be used below. 



10 



Proposition 10. Halves of words from D, i.e. words in the form (3(x)l 2 0^ +3 , can be 
recognized using log space. Moreover, a recognizing algorithm can read input in one 
way. 

Proof. The algorithm runs in two stages. 

At the first stage it reads pairs of symbols and counts the number of pairs. If the pair 
00 is read then the algorithm stops with the negative answer. If the pair 11 is read then 
the algorithm starts the second stage. In other cases the counter of pairs is increased 
by 1. 

At the second stage the algorithm computes and stores a; 2 +3, where x is the counter 
value in the end of the first stage. Then the algorithm reads symbols and counts the 
number of symbols. If the symbol 1 is read then the algorithm stops with the negative 
answer. 

After reading the last symbol the algorithm compares the number of 0s read and 
the stored value x 2 + 3. The answer is positive if these values are equal. Otherwise, 
the answer is negative. 

Correctness of the algorithm is clear as well as logarithmic bound on space used. 

□ 

Guessing a square works similarly. It has two input parameters qi,q2 € Q(A). 
The procedure guesses a word w such that Sa(s, w) = qi, 8a{<11iW) = <72 (hence 
8a (s, ww) = q2). For this purpose it is sufficient to store two states — the current states 
of reading processes starting at the state s and at the state qi respectively. In parallel 
tests for the guessed word might be launched. They also should use log space and read 
a word in one-way. 

Guessing a square is used in the following algorithm that checks triviality of RR 
problem for a finite language. This check will be applied for RR problems with a filter 
containing D. Thus for instances A such that L (A) \ D ^ the answer of RR problem 
is positive. 

Lemma 4. Testing conjunction \L(A)\ < oo and L(A) C D is in NLOG. 

Proof. Testing finiteness of a regular language is in NLOG (Proposition |9). So the 
algorithm in question performs this check at the first stage. If L(A) is infinite the 
algorithm outputs the answer and finishes. So in the sequel we assume that L(A) is 
finite. 

Note that L(A) \ D ^ iff 

• either there exists w 6 L(A) of odd length; 

• either there exists w g L(A) of even length that is not a square; 

• or all words in L(A) are squares and there exists ww € L(A) \ D. 

The algorithm guesses among these three cases and tests a chosen property. 

Note that by Proposition [7] lengths of words from L(A) are not greater than input 
size (i.e. DFA description). So counting up to a word length can be done using log 
space. 



11 



In the first case the algorithm guesses a word w G L(A) of odd length (and check 
the parity of the length). 

In the second case it guesses position i, the length 2k and word w of the length 2k 
from L(A) such that ith and (k + i)th bits of w are different (and uses counters to find 
these bits). 

In the last case it guesses a square ww G L(A) \ D and checks that ww (£ D using 
Proposition [TOl Parameters are states q, t such that Sa(s,w) = q and Sa(q,w) — t. 
The algorithm guesses them before guessing a square. 

Correctness of the algorithm follows from the above observations. □ 

Our next goal is an algorithm that extracts words accepted by an automaton recog- 
nizing a finite language and outputs their images under inverse map sq _1 . We assume 
here that the regular language in question sits in D. 

Let A be DFA such that L(A) C D, Q be the state set, s be the initial state and 
Q a be the set of accepting states of A. We denote by 5 a the transition function 5: Q x 
A* — >• Q of A extended to the set of words in the input alphabet of A in natural way. 

Suppose that all words in L(A) are squares. Define a map 

li:L{A)^Q (9) 

by the rule: if w = uu G L(A) then fj,(w) — Sa{s, u). 
Map (0 is not injective. But its preimage is bounded. 

Proposition 11. If L(A) is a square language then for any accepting state t, integer k 
and a state q there exists at most one word uu G such that \u\ = k and 

Sa(s, uu) = t. 

Proof. By contradiction. If two different squares uu, vv satisfy these conditions then 
uv G L(A) but is not a square. □ 

Remark 6. Proposition [TTI implies a polynomial bound of the cardinality of L(A) 
consisting of squares. 

From propositions l7land [TTI we get a bound 

\L(A)\ = J2 \h-Hq)\< ' \Q\ ■ \Q\ = IQI 3 A 

Lemma 5. There exists an FNL algorithm that outputs the list of words ofsq^ 1 (L(A)) 
provided L(A) C D. 

Proof. It follows from Proposition [TTI that each word uu G L(A) is uniquely de- 
termined by states q G Q, t G Q a and integer k such that = q, \u\ = k, 
8a(s,uu) = t. PropositionQguarantees that k < \Q\. 

The algorithm in question tries all possible values q, t, k. For each triple it uses an 
NLOG oracle to check an existence of a word uu G D determined by the triple. 

An NLOG algorithm for this check guesses a square with parameters q, t. In paral- 
lel it verifies that the length of a guessed word u is k and uu G D. For the former test 
it uses a counter of guessed symbols. The latter is possible due to PropositionfTOl 
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If uu 6 D for a triple q, t, k does exist then the algorithm decodes x such that 
u = (3(x)l 2 0> x > +3 and outputs x. 

It can be done by guessing a word u combined with two simulations of reading 
the word u by the automaton A. One simulation starts from the initial state s and the 
another starts from q. These two simulations need to store two current states qi q 2 
respectively. 

Guessing a symbol is replaced in this procedure by two trials. For each possible 
variant a of the next symbol (there are two of them) the modified algorithm simulates 
reading the pair aa and for new values q[ and q' 2 asks an NLOG oracle about existence 
of a word w such that 6A(q[,w) — q, 5a{q.2t w ) = The corresponding NLOG 
algorithm is a modification of guessing a square. Due to NLOG = co-NLOG an 
negative answer can be also verified on nondeterministic log space. 

The word uu is unique for the triple q, t, k. So the oracle answers positively for 
exactly one value of a. This value is the next symbol of x and the algorithm outputs 
it. " □ 

Tying up loose ends we get the proof of the main result. 

Proof of Theorem\2\ Let us prove that 

X4°S RR(Xsq) <;FNL x (10) 

The first reduction does exist due to © and Proposition [8] 

Now we construct the second reduction. Let A be an instance of RR(X sq ). The 
reducing algorithm checks infiniteness of L(A) and L(A) \ D 7^ using Lemma|4] 
If R is infinite or it contains a word from D then the reducing algorithm forms a query 
list of length 1 containing a fixed element iqGI and output the list. 

Otherwise the algorithm outputs the list of all words from sq _1 (L(A)) applying 
the algorithm from Lemma [5] 

To prove correctness of the above reduction note that D is regularly immune. So 
any infinite regular language has a common word with X sq . If L(A) is finite and 
contains a word from D, then it has nonempty intersection with X sq . Finally, if a finite 
language L(A) is contained in D then R n X sq 7^ iff the output list contains a word 
from X. □ 

Taking into account Statement|5]we get the following corollary. 

Corollary 3. Each class Y? k , W p k of the polynomial hierarchy contains an RR problem 
that complete for a class under -reductions ( and under polynomial reductions). 



5 Universality of generalized nondeterministic models 

RR problems are closely related to models of generalized nondeterminism (GNA) in- 
troduced in J3). GNA classes are parametrized by languages of infinite words (certifi- 
cates). It was shown in [ 1 ] that each GNA class contains RR problem complete for the 
class under ^J° s -reductions. A filter of RR problem is formed by prefixes of certificates 
for the GNA class. 
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Note that filters in the proof of Theorem |2] are not prefix closed. So they do not 
correspond any GNA class. To prove universality of GNA classes we need a more 
sophisticated construction. 

What RR problems correspond to GNA classes? To be prefix closed is a necessary 
condition only. The second condition is the following: each filter word is a proper 
prefix of a filter word. These two conditions guarantee that the filter is the prefix set 
for some set of certificates. 

To satisfy the second condition a very simple modification of a filter is needed. 

Proposition 12. If L C £*, # $ £ then 

RR(L) <£ g RR(L#*) ^|° s RR(L). 

Proof The language L is a regular restriction of We get the first reduction from 
this observation and [ 1 , Lemma 4]. 

Now we construct the second reduction. Let A be an instance of the problem 
RR(i#*). To output an instance B of RR(L) the reducing algorithm changes the 
set of accepting states of DFA A only. A state q is accepting for the automaton B iff 
Sa(q, is an accepting state of A for some k. 

To check correctness of this reduction take a word in the form w# k , w £ LPiL(A). 
Then B accepts w by construction. In the other direction: if B accepts w G L then after 
reading w the automaton A moves to a state such that an accepting state is reachable 
by reading a sequence of 

Note that RR(#*) e LOG [1, Corollary 1]. Using this observation a log space 
algorithm for the second reduction can be easily constructed. □ 

Thus for any RR problem with a prefix closed filter there exists an equivalent RR 
problem with a filter that coincides with the prefix set of a language of infinite words. 

So universality of GNA classes follows from the following generalization of Theo- 
rem |2] 

Theorem 3. For any nonempty language X there exists a prefix closed filter P such 
that 

To prove Theorem[3]we follow the same plan as for Theorem |2] 
Again we use monoreductions. We choose the filter P in the form XL for some 
map / and some set X' which is ^|° s -equivalent to X. Actually we will use X' = 

OlU{e}. 

To make the filter P prefix closed we choose an appropriate map /. Namely, we 
require that / maps words in the form 0{0, 1}* to a prefix code D (i.e. a set of words 
such that no word in the set is a prefix of an another word in the set). Also we require 
that / maps all nonempty words to the set D' = {w : w = uv, u e D} of all possible 

extensions of D words. Then f(X') is suffix closed while P = Xj — f(X') is prefix 
closed. 

By construction P contains £)t. So for an instance A of the RR problem RR(P) 
the answer is positive if L(A) \ ^ 0. The latter condition plays now a role of 
infiniteness condition in the proof of Theorem|2] 



14 



To keep arguments from the previous section a check of L(A) \ ^ should 
belong to NLOG and generation of the list containing all prefixes of words from L(A)D 
D should be done by a FNL algorithm provided L(A) C D', 

Now we elaborate details of this plan. 

We use the set D from the previous section as a prefix code (see Corollary |2). 
The set D' is not regularly immune. But in the crucial case L(A) C D' (see the 
above plan) the Z?-prefix set 

Rd = {w : w £ D and wv £ L(A)} 

is finite. Below we modify algorithms from Section[T]to deal with the sets of this form. 
Now we upperbound the cardinality of Rd and lengths of words in Rd- 

Proposition 13. If L(A) c D^, where A is DFA with the state set Q, \Q\ = n, and 
w £ Rd then 

\w\ < 2(n - 3) + 4\/^ — 3 + 10. 
Proof By definitions a word w G D has a form 

I3(x)l 2 e2+3 l3(x)l 2 £2+3 , 

where I = \x\ and wz £ L(A) for some z. 

If £ 2 + 3 ^ n then while reading the word +3 the automaton visits some state at 
least two times. Thus for some integer k > and all integers i > Owe have 

wt = f3(x)l 2 e2+3+ * k (3(x)l 2 e2+3 z £ L(A). 

Note that for i > a word Wi does not have a D prefix in contradiction with the 
hypothesis L(A) C D'. Indeed such a prefix must have a prefix (3(x)l 2 and due to 
Proposition|6]must coincide with the word w. But w is not a prefix of W{ for i > 0. 

Thus I 2 + 3 < n. The length of w equals 2£ 2 + it + 10. It implies the required 
inequality. □ 

To upperbound the cardinality of Rd we modify PropositionfTTI 

Proposition 14. Let L(A) C D^. For any coaccessibl^ state t, integer k and a 
state m there exists at most one word uu £ D such that \u\ — k and Sa(s, u) = rn, 
Sa{tti, u) = t. 

Proof. Suppose there are two words uu and vv satisfying these conditions. Then 
uvw £ L(A) for some w. From Proposition [6] we conclude that a word uvw does 
not contain D prefixes in contradiction with the hypothesis L(A) C D'. □ 

Corollary 4. IfL(A) c then \R D \ < 2|Q| 3 (1 + o(|Q|)). 

Correctness of the algorithm from Lemma|5]is based on Proposition[TT] Replacing 
it by Proposition[l4]we get the following lemma. 

4 'A coaccessible state is a state from which it exists a path to an accepting state. 
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Lemma 6. There exists a map f G FNL that generates the element list o/sq 1 (Rd) 
provided L(A) C . 

Proof. The algorithm mimics the algorithm from Lemma |5]but for guessed squares it 
tests a membership to prefixes of L(A) (instead of a membership to L(A) itself). The 
prefix set of L(A) is recognized by DFA A p having the same state set and the transition 
table as DFA A. Accepting states of A p are coaccessible states of A. 

It is possible to check coaccessibility on nondeterministic log space. So a FNL 
algorithm is able to apply this test. 

Correctness of the algorithm follows from Proposition [14] □ 

Lemma 7. Testing L(A) C is in NLOG. 

Proof. The statement is equivalent to RR(ZJT) e NLOG due to equality NLOG = 
co-NLOG. 

The algorithm is similar to the algorithm from Lemma 2] 

Let DFA A with the state set Q be an instance of RR(£)t). The algorithm guesses 
a word from P\L(A) and verifies this condition. It uses log space and reads an input 
in one way. 

To detect a word from £)t n L(A) the algorithm guesses a prefix of a word w £ 
L(A) certifying that to ^ DL 
At first a prefix in the form 

i?(a)(eU{0,l}), N=5|Q|, (12) 

guarantees that L(A) \ ^ due to Proposition[T3l 
Note that each language from the following list 



/3(x)00, 








(13) 


/3(x)110 l l, 




i < \x\ 2 - 


1-3, 


(14) 


/3(x)110\ 




i > \x\ 2 - 


1-3, 


(15) 




h3 /3(y)00, 






(16) 


/8(s)110 |a,a - 


h3 /%), 


M < 




(17) 




h3 /3(y)n, 


\A > \v\, 




(18) 




ha ^(y)non, 


i < \y\ 2 - 


-3, 


1*1 = \y\, d9) 






i > \y\ 2 - 


-3, 


\x\ = \y\. (20) 



do not contain neither words having a prefix from D nor prefixes of words from D. 

Also, each condition dT2l420b is recognized by a log space algorithm reading an 
input in one way provided the length of a word is less then 10|Q(j4)| + 1. 

With the same restrictions it is possible to verify that the language L(A) contains a 
word to such that to is a prefix of a word from a language presented in the list (TT2~l - l20l >. 
More exactly, for (l20b we exclude prefixes in the form 

(3(x)110^ 2+3 (3(y)llQ^ 2 + 3 , \x\ - \y\. (21) 
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If L(A) do not contain any word satisfying the above requirements then each word 
from L(A) has a prefix in the form (|2TV In this case the algorithm should indicate a 
prefix with x 7^ y. If such a prefix does not exist we have L(A) C D', 

To indicate a prefix in the form (fJT) with x 7^ y the algorithm guesses two states 
q, t such that t is coaccessible (and checks coaccessibility by consulting with NLOG 
oracle). 

The algorithm also guesses states 

qi = S A (s,f3(p)), q 2 = 5 A (q,P(p)), (22) 

where p is the greatest common prefix of x and y. Then it simulates reading (3(p) 
starting from states s and q keeping the number of guessed symbols in a counter. It 
continues by reading 01 in one case and by reading 10 in the another (to guarante 
that x 7^ y). After that the algorithm guesses two words separately verifying for both 
of them the format ( f2TT > and counting their lengths. When guessing is finished the 
algorithm checks that the above conditions (l22l and compares the lengths of words. □ 

Now we present a map sq^ to construct the second reduction in (flTt : 

sq 1 " 

sq T : Ox t-> sq(x), (23) 
sq 1 ": Ix ^ sq(7ri(^))7T2(a;), 

where ir(x) = (tti(x), 7r 2 (x)) is abijection {0, 1}* onto {0, 1}* x ({0, 1}* \ {e}) such 
that 7r, 7T _1 are log space computable. 

It immediately follows from definition that sq^ is injective and Im(sq^) = U 
{e}. It easy to see that sq^ and sq^ are log space computable. 

Note that a bijection it in (l23l can be easily constructed from a map 

ip: (x,ay) ^ /3(x)aay, (24) 

which sends {0, 1}* x ({0, l}*\{e}) to a set S consisting of words of the length greater 
than 3 and words 00, 11. Namely, ir(x) = ^ 1 {l(x)), where l is an injective map to S 
such that both l and are log space computable. 

For the sake of completeness we presented a construction of such a map. 

Lemma 8. Let S be a cofinite set of binary words, i.e. \S\ < 00. Then there exists an 
injective map fs such that domain of fs is the set of all words, Im(/s) = S and fs, 
fg 1 are log space computable. 

Proof. Choose a set of binary words M such that all words in M have the same 
length n, \M\ = \S\ and 

n > max \x\. 

xes 

Then choose a bijection xr: S — > M. Now a map fs in question can be defined as 
follows 

fs '. x 1 v lx(x), for ieS, 
/sifflm^O'+'lm, for q G M, 
fs '■ x h- ^ x otherwise. 
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It is obvious that fs satisfies all requirements. 



□ 



Proof of Theorem\3\ It is observed above that the language X is equivalent to the lan- 
guage X' — OX U {e} under ^J^ s -reductions. From definition (l23l the map sq^ sends 
nonempty words from X' to the set D while the images of all nonempty words co- 
incide with D^. It means that sq^(X') is suffix closed, i.e. w € sq^(X') implies 
wv € sq^(X') for all v, and X' s t = sq^(X') is prefix closed. 
Therefore it is sufficient to show that 

RR(X> sq ,)^Xi. 

The reduction is constructed using Lemmata [6] and [7j in a way similar to the proof of 
Theorem 12 

Using Lemma [7] the reducing algorithm checks L(A) c for an automaton A 
which is an instance of the problem RR(X^ qT ). If L(A) <f_ ZF then the answer for this 
instance is positive and the reducing algorithm outputs a trivial query list containing 
the fixed word from X' t . Otherwise the reducing algorithm invokes the procedure 
from Lemma[6]to generate the element list of sq _1 (Rd)- It follows from construction 
of the set X' s t that L(A) n X' ^ ^ iff xj € X' for some j. □ 

Taking into account normality of classes in the polynomial hierarchy we get the 
following corollary. 

Corollary 5. For each k the classes S?, II^. contain RR problems with a prefix-closed 
filter that are complete for a class under polynomial reductions. 
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